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A hard-deadline, opportunistic scheduling problem in which B bits must be transmitted within T time-slots 
over a time-varying channel is studied: the transmitter must decide how many bits to serve in each slot based 
on knowledge of the current channel but without knowledge of the channel in future slots, with the objective of 
minimizing expected transmission energy. In order to focus on the effects of delay and fading, we assume that no 
other packets are scheduled simultaneously and no outage is considered. We also assume that the scheduler can 
transmit at capacity where the underlying noise channel is Gaussian such that the energy-bit relation is a Shannon- 
type exponential function. No closed form solution for the optimal policy is known for this problem, which is 
naturally formulated as a finite-horizon dynamic program, but three different policies are shown to be optimal in the 
limiting regimes where T is fixed and B is large, T is fixed and B is small, and where B and T are simultaneously 
taken to infinity. In addition, the advantage of optimal scheduling is quantified relative to a non-opportunistic (i.e., 
^ ' channel-blind) equal-bit policy. 

I. Introduction 

Although the basic tenants of opportunistic communication over time- varying channels are well understood, much 
' less is known when short-term delay constraints are imposed. Given the increasing importance of delay constrained 
fV") • communication, e.g., multimedia transmission, it is critical to understand how to optimize communication perfor- 
in ■ mance in delay-limited settings. Thereby motivated, we consider the discrete-time causal scheduling problem of 
^sO • transmitting a packet of B bits within a hard deadline of T slots over a time-varying channel. At each time slot the 
^ . scheduler determines how many bits to transmit based on the current channel state information (CSI), but without 
q \ future CSI, and the number of unserved bits, with the objective of minimizing the expected total energy cost. In 
1 order to focus on the interplay between opportunistic communication and delay, it is assumed that no other packets 
are simultaneously transmitted, and the hard deadline must always be met. 

This basic problem was formulated as a finite-horizon dynamic program in [1], but an analytic form for the 
' optimal scheduling policy cannot be found for most energy-bit relationships. Indeed, such a problem is difficult to 
solve because the transmitter only has causal CSI and because a particular rate must be guaranteed over a finite 
time-horizon. In our earlier work [2], we studied this problem in the setting where transmission occurs at the 
capacity of the underlying Gaussian noise channel and proposed different suboptimal scheduling policies. 

Building upon [2], in this work we prove the optimality of certain scheduling policies in different asymptotic 
regimes. In particular, we show that: 

• When the number of bits B is large, the optimal scheduling policy is a linear combination of a delay-associated 
term and an opportunistic -term. The opportunistic term depends on the logarithm of the channel quality, and 
the weight of this term decreases as the deadline approaches. 

• When the number of bits B is small, a one-shot threshold policy where all B bits are transmitted in the first 
slot in which the channel quality is above a specified threshold is optimal. 

• When the number of bits B and the time horizon T are both large, a waterfilling-like policy is optimal. 
These results are particularly important in light of the fact that the general optimal solution appears intractable. In 

addition, the different asymptotically optimal schedulers provide an understanding of how the conflicting objectives 



X 



The work of J. Lee is supported by a Motorola Partnership in Research Grant. 



i_ _y °' 

wireless channel 



Fig. 1: Point-to-point delay constrained scheduling 



of opportunistic communication (i.e. transmit only when the channel is strong) and delay-limited communication 
are optimally balanced, and how this balance depends on the time -horizon and the packet size. 

In addition to showing asymptotic optimality, we also quantify the power benefits of optimal channel- and delay- 
aware scheduling relative to non-opportunistic equal-bit/rate transmission. These results identify that the largest 
benefits are obtained for severe fading, small packet size, and large time horizon. Moreover, we analyze the behavior 
of the scheduling policies for large and small B using results on high and low SNR analysis in [3] and [4]. 

A. Prior Work 

The basic scheduling problem was first proposed and formulated as a finite-horizon dynamic program (DP) in 
[1]. In that work a closed-form solution for the optimal scheduler is provided for the special case where the number 
of transmitted bits is linear in the transmit energy/power and the channel quality is restricted to integer multiples 
of some constant. In [5], the formulation is extended to continuous time; closed-form descriptions of the optimal 
policies for some specific models are found, but these do not directly apply to the discrete-time problem considered 
here. In our earlier work [2], we specialized [1] to the setting where the energy -bit relationship is dictated by 
AWGN channel capacity and proposed several different suboptimal policies. Two of these policies are shown to be 
asymptotically optimal in the present work. 

Prior work has also considered the dual problem of (expected) rate maximization over a finite time horizon, i.e., 
the transmitter determines how to utilize a finite energy budget over a finite number of slots with the objective 
of maximizing the expected rate. This problem was considered in [6], and a one-shot threshold policy and equal 
power scheduling are shown to be asymptotically optimal in the low- and high-SNR regimes, respectively. This 
work was extended to a multiple-access setting in [7]. 

Because transmission scheduling corresponds to power allocation, it is also useful to put the present work in 
the context of prior work on optimal power allocation in fading channels, with and without delay constraints. In 
[8] it is established that waterfilling maximizes the long-term average transmitted rate; analogously, the long-term 
average power needed to achieve a particular long-term average rate is minimized by waterfilling. At the other 
extreme, channel inversion is known to be the optimal policy when a constant rate is desired in every fading state 
[9]. The current setting lies between these two extremes, because our objective is to find a power allocation policy 
(based on causal CSI) such that a particular rate (i.e. B/T) is guaranteed over T fading slots. The case T = 1 
clearly corresponds to zero-outage/delay-limited capacity in [9], while we intuitively expect T — > oo to correspond 
to the long-term average rate scenario of [8]. The latter correspondence is made precise in Section HV-CI 

II. Problem Setup 

This section summarizes the scheduling problem introduced in [2], which is a discrete-time delay constrained 
scheduling problem over a wireless fading channel as illustrated in Fig. Q] A packet of B bitsEI is to be transmitted 
within a deadline of T slots. The scheduler determines the number of bits to allocate at each time slot using 
the fading realization/statistics to minimize the total expected transmit energy while satisfying the delay deadline 
constraint. We assume no other packets are to be scheduled simultaneously and that no outage is allowed. 

The discrete-time slots are indexed by t in descending order (i.e., starting at t = T down to t = 1), and thus t 
represents the number of remaining slots to the deadline. The channel state (at slot t) is denoted by g t in power 
units. We assume that gr,gr-i," ' >9l w& independently and identically distributed (i.i.d.) and the probability 
density function (PDF) and the cumulative distribution function (CDF) are denoted by / and F, respectiveljo 

'We operate in "nats" instead of "bits" since we adopt log-base e expression in the capacity formula to avoid constant factors in the 
analysis. We use "bits" and "bit allocation" as generic terms. 

2 The fading distribution must have a non-zero delay-limited capacity, i.e., E[l/fl] < oo, for this problem to be feasible. 
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The scheduler is assumed to have only causal knowledge of channel states (at time t, gr, • • • ,gt are known but 
gt-i, • • • ,gi are unknown). Assuming unit variance Gaussian additive noise and transmission at capacity, if energy 
E t is used under channel state g t , the number of transmitted bits is given by: 

b t = log(l+g t E t ) (1) 

By inverting this formula, the required energy E t to transmit bt bits with channel state g t is: 

e bt - 1 

Et(b t ,g t ) = . (2) 

9t 

The queue state is denoted by j3 t , which is the number of unserved bits at the beginning of slot t. Thus, 
the number of bits to allocate at slot t is determined by the queue state fit and the channel state g t . That is, 
a scheduling policy is a sequence of functions, indexed by the time step, that map from the current queue and 
channel state to the bit allocation: {br(Pr, <7t)> &T-l(/3r-l> <7t-i)> ■ ■ ■ , &l gi)}- As for terminology, the entire 
set {&t( - > ■)> br-i(:, •)> " " , b\(-, •)} is referred to as a policy or a scheduler, and each element of it is referred to 
as a policy function or a scheduling function. 

III. Optimal & Suboptimal Schedulers 

In this section we describe the optimal scheduling policy, two suboptimal policies introduced in [2], and a 
heuristic modification of the ergodic (infinite-horizon) policy. 



A. The Optimal Scheduler 

The optimal scheduler for the hard-deadlined causal scheduling problem described in Section HH can be found by 
solving the sequential optimization: 



b?\Pu9t) 



arg min < E t (b t ,g t ) + E 

0<b t </3 t 



ih 



t-l 

E s (b s ,g s 

s=l 



bt 



t = T,...,2, 



t = 1. 



(3) 



where E denotes the expectation operator. Equivalently, this can be formulated as a finite-horizon dynamic program 
(DP): 

ebt ~ l +Jtl\(Pt - b, 



mm 

0<6 t </3 t 

9i ' 



9t 



t > 2 
t = l, 



(4) 



where J°^_\(fi) = ~E g [J°^i((3 , g)] is the cost-to-go function, i.e., the expected cost to serve (3 bits in t — 1 slots if 
the optimal policy is used. 

At the final step (t = 1) all f3\ remaining bits must be served because outage is not allowed. At all other steps the 
optimal bit allocation is determined by balancing the current energy cost e *~ and the expected energy expenditure 
in future slots J°Z'i(A — bt). Although the optimal scheduler can be found in closed form for T = 2 (Section III- A 
in [2]), it is not possible to do the same for T > 2 because no close-form expression for the cost-to-go function is 
known for T > 2. Nevertheless, the optimal scheduling functions can be described as [2]: 



bT(Pt,9t) 



0, 



gt < 



{i = iJT\)'^-b)}, w ^ W) <g t 



< 



(J7-i)'(o)' 



(5) 



gt > 



(^i)'(O)' 



where arg 6 {-} represents the solutiorEI of the argument equation. The differentiability of J°Z\ can be verified by 
the properties of convexity and infimal convolution (pp. 254-255 in [10]). 

Proposition 1: The optimal policy function 6° pt (/3t,^t) has the following monotonicity properties: 



3 Because of the convexity, the solution exists uniquely if it exists. 
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(a) For any fixed value g t (> 0), 6° pt and (0 t — 6° pt ) are non-decreasing in t . Furthermore, there exists 53o such 
that 6° pt and (f3 t — b° pt ) are strictly increasing in (3 t for all 0t > ^o- 

(b) For any fixed value /?t(> 0), 6° pt is non-decreasing in g t . 

Proof: See Appendix J] ■ 

Intuitively, monotonicity in the queue 0t and the channel state g t is expected because more bits should be served 
when there remain more unserved bits or when the channel is strong. 



B. The Boundary -relaxed Scheduler 

The first suboptimal scheduler is derived by relaxing the boundary constraints (we no longer require < b t < (3t), 
while maintaining the deadline constraint Ylt=i bt = B. The relaxed version of the original optimization (@]) is 
given by 




+ U t - 1 (B t -b t ) , t>2, 
U t (f3t,9t) = < fc > 9t J ~ (6) 



t = l, 



where TJt-\{0) = E ff [J7t-i(/?, g)} and can be calculated by induction [2]: 

U t {p) = te~ G(u t , v t -x, ■■■ , v{) - tvi, (7) 
where G denotes the geometric mean operator (i.e., &(x\,--- ,x n ) = (11^=1 x k) l ^ n ) and ux,^,-" i v t are tne 



fractional moments of the fading distribution defined as: 

m 



E, 



m = l,2,---. (8) 



Due to the simple form of the cost-to-go function Ut, by substituting (fTJ) into and solving the minimization 
we obtain the following closed-form description of the optimal policy for the relaxed problem [2]: 

where ^ elax serves as a channel threshold given by 

1 



bt(Pt, gt) = - $t + — log ( ) (9) 



^relax = ; ^_ (1Q) 

G(^-l,^-2, • • • 

The policy function in (O solves the boundary-relaxed problem but does not guarantee < bt < fit in each slot. 

To obtain a policy for the actual unrelaxed problem, we simply truncate at and /3 t , and reach what we refer to 
as the boundary-relaxed schedule^: 

6f lax (A,^) = (^ + ^log^)^ (ID 

where (-)g denotes truncation below and above 0f Notice that this policy function is optimal for t = 2, i.e., 
bf™ = b° 2 pt for all 02 and g 2 since (Ut)' = (J° pt )'. 

Note that this same scheduling policy can be reached using the high-SNR approximation log(l + x) w log(x). 
More specifically, if the energy-bit relationship in (f2]) is approximated by: 

e bt - 1 e bt 

Et(b t ,g t ) = - «— • (12) 

9t 9t 

and the optimal policy is found with the same relaxation as above, the policy in Q also reached. 
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This is referred to as the suboptimal II scheduler in [2]. 
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C. The One-shot Scheduler 

The second scheduler is derived by modifying the boundary constraint into a stronger constraint bt G {0,Pt} 
(equivalently, b t G {0,B}), i.e., in each slot either the entire packet is transmitted or nothing is transmitted. Then, 
the dynamic program is given by 



mm 



9i 



t = h 



(13) 



where J° ne (/3) = E ff [J° ne (/3, g)]. Equivalently, we can express the above DP as an optimal stopping problem [11] 
(this can be shown inductively with 0r = B): 



Jr(B,g t ) 



mm 

e fl -l 

- 9i 



J?-\(B)}, t>2, 



t = 1. 



The optimal solution is a sequential threshold policy [2]: 

f B, first t such that g t > 1/oJt, 
bt = < 

I 0, otherwise, 

where 1/ut is the channel threshold in slot t, and is recursively computed as: 



(14) 



(15) 



E 
E 
{oo, 



mm ( ~,Ut-l 



t = T, ■ ■ ■ , 3, 
t = 2, 
t = 1. 



(16) 



Notice that the thresholds depend only on the channel statistics and are independent of B, and that the thresholds 
decrease as the deadline approaches (i.e., as t decreases) [2]. 



D. The Delay-constrained Ergodic Scheduler 

The above two suboptimal policies are developed to solve the DP, formulated in (0]), by simplifying the cost-to-go 
function. Unlike these two policies, we now consider a policy by modifying the ergodic scheduling policy to meet 
the hard deadline constraint. The ergodic policy is the optimal solution to a problem of minimizing the average 
energy to transmit a certain average number of bits (i.e., no hard deadline constraint). If we denote this average 
rate constraint as b, the ergodic scheduling policy function b(g), which does not depend on t and determines how 
many bits to transmit based only upon the channel state g, is determined by solving: 



E STS (b) 



mm 

Kg) 



E 



ob(g) 



(17) 



subject to E g [b(g)} > b, b(g) > 0. 



This optimization is readily solvable by standard waterfilling [12] and the solution is given by 



b^(b,g) 



log I 

1 ^erg 



log(#i 



g > v erg , 

else, 



(18) 



o 10, 

where rf tg serves as a channel threshold and is the solution to: 

E[b^(b,g)]=b. (19) 
When the time-horizon T is large, we intuitively expect the ergodic policy to perform well in the delay-limited 

B 

T 



setting considered here. In order to meet the deadline constraint, we utilize the ergodic policy, with b = ^ + 5 for 
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some 5 > oJl at each time step with the exception that all remaining unserved bits are transmitted in the final step: 

,constrained-erg ( B \ I 6 elg (S, + (5, (ft) , t = T, T — 1, • • • , 2, 

lr^ a J = ( A , <20) 

which is refeiTed to as the delay-constrained ergodic scheduler. 

IV. Asymptotic Optimality 

This section investigates the optimality of the suboptimal schedulers introduced in the previous section. The 
optimality can be analyzed in two ways: optimality in policy and optimality in the associated energy cost. Both 
forms of optimality are shown for the boundary-relaxed scheduler and the one-shot scheduler, whereas energy 
optimality is shown for the delay-constrained ergodic scheduler. 

A. Large B and Finite T: Asymptotic Optimality of Boundary-relaxed Scheduler 

We first prove that the boundary-relaxed scheduler converges to the optimal policy when T is fixed and the 
number of bits B is taken to infinity. When B is large, we intuitively expect that the optimal policy will allocate 
strictly positive bits to all T time slots with high probability due to the nature of the Shannon energy-bit function. 
Thus, we expect the boundary-relaxed scheduler to coincide with the optimal policy when the number of bits to 
serve is large. The following theorem makes this relationship precise: 

Theorem 1: Let the PDF / of g t be continuous on [g min ,g mSLX ] with Support (/) = [g min , g max ], where g min > 
and gmax < oo. For every time step t, the boundary-relaxed policy function in (fTTT> converges to the optimal 
scheduling policy function uniformly on [g m - m , g max ] as the number of unserved bits (3 goes to infinity: for every 
given e > 0, there exists 5So suc h that 

K ax (P,9t) ~ b?(P,9t)\ < e, V 5t G [fc^ftnaj- (21) 

for (3 > <B . 

Proof: See Appendix HT1 ■ 

Figure|2^ illustrates the behaviors of b^ lax (l3, 53) and b^ pt (P, g%) vs. #3 for different values of f3 and Fig. illustrates 
the behaviors in terms of j3 for different values of 53, when g is a truncated exponential variable with a support 
of [0.001, 10 6 ] (the pdf is given in (l30l)). When 53 = 0.5, for instance, it can be seen that the difference between 
63 lax and 6g Pt gets smaller as (3 increases in both Fig. [2^ and Fig. [2J5. Notice also that the value of (3 making the 
difference between 6 3 elax and 63" small varies with the value of 53. As can be seen in Fig. [2}), larger (3 is required 
for larger 53. Additionally, we can observe from Fig. [2j) that the slope of the plots is 1 in small (3 and the slope 
changes to | for some larger (3 depending on the value of g%, which is due to the policy function in (fTTb . 

We now compare the incurred energy costs of the two polices. We first define the incurred energy with the 
boundary -relaxed scheduler as: 



, relax _ 

+ J£? (A - 6f lax ), t = T, T - 1, • • • , 2, 



Jf^{(3 ugt ) = l ^ '"t-iv« "» » - >~> (22) 

where Jj!l ax (/?) = KgiJ^fiP, g)]- Notice that (1221 is not an optimization but is instead a calculation based upon 
the definition of bf lax in (fTTI ). Also notice that JJ elax denotes the cost for the actual un-relaxed problem (the energy 
cost with a policy satisfying < b t < (3t for all t), while the function U t defined in Section IIII-BI denotes the cost 
for the relaxed problem (the energy cost with a policy that may not satisfy < b t < 0t). 



5 This policy is motivated by Theorem 3 of [7], where a modified version of the ergodic rate-maximizing policy is shown to maximize 
the expected transmitted rate over a finite time-horizon when the transmitter is subject to a finite energy constraint (which is the dual of the 
problem considered here). 
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Theorem 2: Let the PDF / of g t be continuous on [g min , g mSLX ] with Support(/) = [g min , g max ], where g min > 
and g max < oo. For any number of time slots T, the energy cost of the boundary-relaxed scheduler converges to 
the optimal energy cost as the number of bits B goes to infinity: 

lim \jf ax (B) - J^\B)] = 0. (23) 

B — >oo 

Proof: See Appendix ITTT1 ■ 

While proving Theorem [2j we obtain the asymptotic relations between the actual cost of the boundary-relaxed 
scheduler, the cost of the relaxed version, and the cost of the optimal one, i.e., limB->oo [J^ lax (B) — Ut(B)] = 
and lims^oo [U T {B) - J^\B)] = 0. Since we have a closed-form expression of Ut(B) shown in ©, these 
relations help us understand the behavior of the optimal cost for large B, which will be discussed in Section IV-AI 
Although the analytic form of the optimal scheduler is not available, the above two theorems tell us that the 
boundary-relaxed scheduler, which has a very simple form that can be easily implemented, is asymptotically optimal 
when the number of bits to transmit (B) is sufficiently large. Furthermore, the scheduling function (ITTb provides 
intuition on the interplay between the channel quality and the deadline. When the deadline is far away (large t), the 
bit allocation is almost completely determined by the channel quality; on the other hand, as the deadline approaches 
(small t), the policy becomes less opportunistic. 



B. Small B and Finite T: Asymptotic Optimality of One-shot Scheduler 

We now show that the one-shot scheduling policy is asymptotically optimal when T is fixed and B is taken to 
zero. We first show convergence in terms of the policy function, and then in terms of the energy cost. 

Theorem 3: For arbitrary time step t, the one-shot policy function in (TT5l) converges to the optimal scheduling 
policy function as the number of unserved bits j3 tends to zero, i.e., the optimal policy becomes a threshold policy 
and the threshold coincides with the threshold of the one-shot policy: 

lim sup{ 5 : b° p \(3, g) = 0} = lim inf{<7 : g) = (3} = (24) 

where I /cut is the threshold of the one-shot policy as in (fT5T ) and ( fT6b . 

Proof: See Appendix IIVI ■ 
Furthermore, we claim that the costs of the two policies also converge to one another. Since the average costs 
for the two policies converge to zero as B — > 0, cost convergence is investigated by studying the ratio, rather than 
the absolute difference, between the two costs: 
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Fig. 3: Additional power cost of one-shot scheduling relative to optimal scheduling as a function of B, when g is 
a truncated exponential variable with support [0.001, 10 6 ] 



Theorem 4: For arbitrary delay deadline T, the energy cost of the one-shot scheduler converges to the optimal 
energy cost as the number of bits B goes to zero: 

'-(B) 



lim 



rone / 



B-0 X°/(B) 



1. 



(25) 



Proof: See Appendix IVl 



In Fig. [3] the additional power cost of one-shot scheduling relative to optimal scheduling (i.e., 10 log 



(B) 



'10 jfCB)") 1S 

plotted versus the number of bits B for T = 2 and T = 3 when g is a truncated exponential variable with a support 
of [0.001, 10 6 ]. As can be seen, the ratio converges to 1 (0 dB) as B converges to 0. 

The optimality of one-shot scheduling can also be seen by upper and lower bounding the energy-bit function by 
linear functions. Using x < e x — 1 < xe B for < x < B, we have: 



— < E t (b t ,g t ) < 
9t 



B 



9t 



(26) 



If we solve the DP using either of these bounds on the energy-bit function, the optimization in (01) becomes 
a linear program and thus a one-shot policy is optimal because a constrained linear program has a solution at a 
boundary of the constraint. Furthermore, the one-shot policy based on the upper and lower bounds converge to the 
one-shot policy described in Section IIII-CI as B — > because the bounds themselves converge. 



C. Large T: Asymptotic Optimality of Causal Delay-constrained Ergodic Scheduler 

When B and T are simultaneously taken to infinity at a particular ratio (i.e., B, T — > oo with B 
constant b > 0), we can show the energy-cost optimality of the ergodic policy in Section IIII-DI 
The average energy cost of the delay-constrained ergodic scheduler is given by 

^J 1 i constrained -( 



X 



constrained-erg 



T 



(bT- 5) = E 



E 

.t=i 



1 



9t 



E 



E 

.t=2 



1 



9t 





-11 


+ E 






. 91 . 



bT for some 



(27) 



where 0\ denotes the remaining bits at the final slot and the value of 5 is chosen such that 



X 



constrained-erg 



(bT) = inf X 



<5>0 



constrained-erg 
T 



(It- 5). 



(28) 
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Theorem 5: For any given average rate b(> 0), the per-slot energy cost of the delay-constrained ergodic policy 
converges to the optimal ergodic energy cost as T tends to infinity: 

lim Ij-n~d-erg^ T) = ^ Ijopt^ = (29) 

Proof: See Appendix IVII ■ 

The effect of the hard-deadline becomes inconsequential for large T because the channel realizations over the 
deadline horizon closely match the fading distribution. As a result, the delay-constrained ergodic scheduler performs 
similar to the ergodic scheduler when T is large. Moreover, the delay-constrained ergodic scheduler becomes causal 
optimal since any causal policy cannot be better than the ergodic policy. 

D. Numerical Results: Policy Comparison 

In order to compare the different asymptotically optimal policies, we compare their respective energy costs for 
different time-horizons (T). Since the analytical expression for the optimal policy is not available for T > 2, we 
solve the dynamic programming (0]) numerically by the discretization method [13]. In Fig. [4] the per-slot energy 
consumption of the suboptimal schedulers is plotted for T = 5 and T = 50 assuming that the fading {gt}f = i are 
i.i.d. truncated exponential with a support of [0.001, 10 ], i.e., 

f{g) = fce^-o.ooy if 001 < g < lo6) 
lo, otherwise, 

where c is a normalization factor. As can be seen, the one-shot scheduler is near-optimal only when B is small. 
The other schedulers performs close to the optimal through all ranges of B. When T = 5, as in Fig. [4^, the 
delay-constrained ergodic scheduler performs worse than the boundary-relaxed for all B. This is because T = 5 is 
too small for the delay-constrained ergodic scheduler to perform like the optimal. When T = 50, as in Fig. |4j), there 
exists a range of B such that the delay-constrained ergodic scheduler outperforms the boundary-relaxed scheduler. 
This phenomenon can be clearly illustrated in Fig. [5J where the number of bits are given in logarithmic scale. 
As can be seen in Fig. [51 the one-shot scheduler performs best for small B (region A) and the boundary-relaxed 
scheduler outperforms when B is very large (region C). In the middle range (region B), the delay-constrained 
ergodic scheduler performs better than the other two. 
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V. Scheduling Gain 

We have shown that the boundary-relaxed and the one-shot schedulers are asymptotically optimal as B — > oo 
and B — > 0, respectively. Another interesting issue is quantifying the advantage these schedulers provide compared 
to a non-opportunistic equal-bit scheduler that simply transmits B/T bits during each time slot. 

To compare energy performance, we first calculate the expected energy cost of the equal-bit scheduler, which is 

T 



T t ^\b)=e 



t=i m 



T \ e^v\ — ui), (31) 



since the equal-bit scheduler chooses bt = B/T for all t. Notice that the equal-bit scheduler achieves the delay- 
limited capacity [9] [14] (i.e., zero-outage capacity) with rate B/T. 

We define the scheduling gain as the ratio between the expected energy expenditures: 

/equal /gx 

A f(B) 4 1 W (32) 

and quantify its behavior in the following theorem: 

Theorem 6: For any T, the scheduling gain A°n Pt (i3) is monotonically decreasing with respect to B. Furthermore, 
the limiting scheduling gains are given by: 



J^\B) Vl 
i i. i j i __i , ■ i jd i = mil — — — = 



lim A° pt (B) = lim ? ; ' = (33) 



and if the PDF of the fading distribution is compactly supported and continuous, 

Tequal / r>\ 

lim A° pt (5) = lim T )"> = -. (34) 

Proof: See Appendix IVII1 ■ 

Since the boundary -relaxed scheduler is optimal as B — > oo, the scheduling gain of the optimal scheduler and that 
of the boundary -relaxed scheduler are the same as B — > oo; the same is true for the optimal and the one-shot 
scheduler as B — > 0. The plot of scheduling gain vs. B in Fig. [6] agrees with the results of Theorem [6] Intuitively, 
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Fig. 6: Scheduling gain A5 when g is a truncated exponential variable with support [0.001, 10 6 ] 



TABLE I: Scheduling gain examples for several fading distributions 



distribution of channel state g 


T = 5 


T= 10 


T= 50 


lim Af'(B} 

B — >oo 


lim AT(B) 

B^O b 


lim A£(Jj; 

B — >oo 


lim A£(fl) 

B^O u 


Km A-(B) 

B — >oo 




truncated exponential with supp. [0.1, 10 b ] 


0.97 dB 


4.42 dB 


1.26 dB 


5.98 dB 


1.63 dB 


8.59 dB 


truncated exponential with supp. [0.01, 10°] 


2.19 dB 


6.72 dB 


2.80 dB 


8.63 dB 


3.52 dB 


11.51 dB 


truncated exponential with supp. [0.001, 10 b ] 


3.38 dB 


8.38 dB 


4.22 dB 


10.44 dB 


5.17 dB 


13.40 dB 



scheduling delivers a larger power gain for small B because in such scenarios one can be very opportunistic and 
transmit the entire packet once a sufficiently good channel state is realized. For larger B, however, it is inefficient 
to transmit the entire packet in a single slot (because energy increases exponentially with the number of bits) and 
thus transmissions must be spread across many slots (in fact, all slots are used as B — > 00), which reduces the 
channel quality during those transmissions and thus reduces the benefit of scheduling. 

In Table U the limited scheduling gains are showed for various fading distributions. As intuitively expected, the 
scheduling gain is larger for more severe fading distributions and for larger time horizons T. From the fact that both 
G{vt, ■ • • and wr+i decrease as T increases [2], the asymptotic scheduling gains in d33l and (l34l ) increase 
with T. 

A. Large B Behavior (High SNR) 

When B is large relative to T, it is useful to interpret the scheduling gain in terms of the well-known affine 
approximation to high-SNR (P) capacity [3]: C(P) = 5oo(logP — C^) + o(l), where denotes the slope 
representing the multiplexing gain and denotes the constant term representing the power/rate offset. We define 
the average SNR on a per-slot basis, i.e., P = Jt/T. Similarly, the average rate is defined as Rt = B/T, which 
represents the average spectral efficiency per slot. Then, we investigate Rt in terms of P and T: 

R T (P) =S 00 (logP-C 00}T ) + o(l). (35) 

With algebraic calculations, we can obtain Soo and £qo,t for the equal-bit policy, the optimal scheduler (which 
is equal to the boundary -relaxed scheduler in this regimd_|), as wei l as the ergodic capacity (see Appendix I VIII I for 




6 We obtain this result in the process of proving Theorem [2] and thus we limit the fading distribution as conditioned in Theorem [2] i.e., 
the PDF / is compactly supported and is continuous on the support. 
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Fig. 7: £oo,t f° r several fading distributions 



derivation). The three policies have the same multiplexing gain (degrees of freedom) per slot (<S 
offsets Coo t are different: 



equal 

oo,T 

/-opt 

*-oo,T 

/-erg 



log Ui, 

logG(l/ T ,U T -l, 



1), but the 

(36) 
(37) 
(38) 



The offset of the equal-bit scheduler is independent of T since it does not take advantage of time diversity. On the 
other hand, the offset of the boundary-relaxed scheduler decreases with T since G (^t> fT-i> • • • >fi) decreases 
[2]. Moreover, the offset of the boundary -relaxed scheduler converges to that of the ergodic capacity because 
G {yr, vt-I) • ■ • > vi) — * ^oo as T — > oo [2]. Figure |7] illustrates the offsets £oo,T for several fading distributions. 
As can be seen, C ^^ for all the fading distributions decreases from C^'^ as T increases and converges to C^rp. 

' oo,i & oc,i & oo,J 

We can also see that the offsets £oo,T have larger values for more severe fading distributions. 

Figure [8] illustrates the behavior of the spectral efficiency versus SNR. The dashed lines are obtained from the 
affine approximations in 051 ) while the solid lines are obtained numerically by running the optimal scheduling 
policy. As can be seen, the affine approximations are very accurate when SNR is 20dB or higher. Furthermore, 
as T increases the spectral efficiency increases from the delay-limited capacity (achieved with the T = 1 optimal 
scheduling or the equal-bit scheduling) to the ergodic capacity (achieved with the T = oo optimal scheduling). 

It is interesting to note that for the dual problem of rate maximization over a finite time horizon when subject 
to a per-realization energy constraint (i.e., for every realization of channel gains gr,- " >Si the amount of energy 
used by the scheduling policy cannot exceed some constraint E considered in [6] and [7], at high SNR the optimal 
policy converges to uniform power allocation (independent of channel state) and there is no advantage to using an 
intelligent scheduling policy. This is to be contrasted with the setting considered here, where there is a non- vanishing 
benefit to using the optimal scheduler even at high SNR (i.e., large B). 



B. Small B Behavior (Low SNR) 

In this regime, we characterize the linear approximation to the spectral efficiency versus ^ curve based on 
the wideband analysis in [4]. The linear approximation consists of a constant term ( jf- ) and a slope Sq that 

V / min 

represent the minimum energy per bit for reliable communication and the growth of spectral efficiency with respect 
to jk. To be clear, we adopt the notion of Ef, as the required energy per slot to transmit one bit per slot instead 
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of the required energy to transmit one bit throughout the entire T slots: 



Rr ( ^ ) ~ S ( >:i 



' N 



dB 



min,T 



dB 



V 



3dB 



(39) 



/ 



These parameters <Sq,t and ( 



min,T 



can be obtained for the equal-bit scheduler and the one-shot scheduler, 



which is optimal for B — » 0, (see Appendix HXl for derivations): 

'M eq 



°0,T 



cone 
°0,T 



_2 

T' 



min,T 
one 



and both and f 



N J 

N 0/ m in,T 
erg 

min,T 



flog 2)1/! , 



1 



mm | — , E 

9T 



(log2)E 
are zero for ergodic capacity. 



mm 



9T-1 



E 



mm 



92 



1 

91 



Figure [9] illustrates the behavior of ( jr 



min,T 



and 5o,t with respect to T. As can be seen, both ( 



(40) 
(41) 

and 



min,T 



5o,t decrease from the delay-limited values to the ergodic capacity values as T — > oo due to the available time 
diversity. 



VI. Conclusion 

We have shown the asymptotic optimality of three different scheduling policies for delay-constrained transmission 
over a fading channel. When only a small number of bits need to be served, a one-shot threshold policy is optimal: 
once a sufficiently good channel state is experienced, the entire packet is transmitted. On the other hand, when the 
number of bits is large, the number of transmitted bits at each time step should be a weighted sum of the unserved 
bits and a channel state-related term, where the weight is proportional to the time to deadline. In each of these two 
policies, the scheduler is opportunistic while also being cognizant of the deadline. Furthermore, a modification of 
the ergodic waterfilling policy is shown to be optimal when the number of bits and the time horizon are both large. 

Although problems involving delay-limited communication are of great practical importance and have been the 
subject of considerable research, such problems generally do not have closed-form solutions. In this work, however, 
we are able to circumvent this general difficulty by considering different asymptotic regimes. It would be interesting 
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Fig. 9: Low SNR behavior when g is a truncated exponential variable with support [0.001, 10 6 ] 



to see if the asymptotically optimal policies identified here, which admit a very simple analytical form, can be 
extended to other more general settings. For example, to scheduling with time-varying channels and randomly 
arriving packets [15] [16] and possibly to multi-user channels [17]. 



Appendix I 
Proof of Proposition Q] 

(a) (i) We show the monotonicity of 6° pt in fit- When gt < 1 



and thus is non-decreasing in 



j3 t . When 



(j^O'Cft) < 9t < GS)'(o)' 
and ( J^iYiPt — ^° pt ) increases. As a result, 6° pt increases but this leads a contradiction. Thus, b° pt is non- 



-, we suppose that 6° pt decreases in (3 t - Then, ((3 t — b° pt ) increases 



decreasing in (3 t when rpryT^) < 9t < Wnen ft ^ 

in /3 f . 

(ii) We show the strict monotonicity of b° pt for large fit- To do this, we first show the unboundedness of 
{J° P \Y- Suppose not, i.e., there exists M(< oo) such that (J°Z\)'(/3) < M for all (3(> 0). By integrating 
both sides, we have J^ X (J3) < M(3 for all /3{> 0). Note also that J^l\(/3) > E 



, 6° pt = /?t and thus is non-decreasing 



e /3/t_l 



max(s t _i,- 



for all 

max(g t _i,- 

< M/3 for all /?(> 0), which leads a contradiction. 



e/V*_i 



/?(> 0) by ©. Consequently, we have E 
Therefore, (J° P i)' is unbounded. 

Since (J°^i)' is unbounded and monotonically increasing, for any given g t there exists Q3o such that 



, 7 o P , w . < gt < , ppf Wo n f or a H A > 2$o- I n this region of <ft, we showed that rj° pt is non-decreasing in (3 t 

by (i). Suppose 6° pt maintains a constant value as (3 t increases in this region of g t . Then, (/5 t — 6° pt ) increases 
and ( J° P \)'(A — b° pt ) increases. As a result, b° pt increases but this leads a contradiction, too. Therefore, 6° pt 
increases strictly in (3 t if Pt > 33o- 
(iii) Finally, we show the monotonicity of Q9t — b° pt ) in /3 4 . Since rj° pt is non-decreasing in 0t, is non- 
decreasing. Since («/°Zi)'Q is an increasing function, (fit — 6° pt ) must be non-decreasing in /3 t . If /3f > 2$o, 
then rj° pt is strictly increasing by (ii) and thus (j3 t — b° pi ) is strictly increasing by the same argument. 

, b° pt is constant as g t increases, and thus is non-decreasing with respect 



(b) if gt < xJ^yW) and at > 

to gt- When 1 



(JZxYW <9t< (J^i)'(o)' 



(42) 
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If we suppose that b° pt decreases strictly as gt increases, (fit — 6° pt ) will increase and thus (J°^\)'(f3t — b° pt ) 
will also increase. This leads a contradiction because the left hand side of (l42l ) decreases strictly while the 
right hand side increases. Therefore, 6° pt is non-decreasing in g t . 



Appendix II 
Proof of Theorem \T\ 

We show the result by induction, i.e., we show that if the scheduling functions converge at time step t — 1, 
then the functions also converge at time step t. The base cases occur at t = 1 and t = 2: by construction, 

bf^(f3, g 2 ) = b op \f5, 92) for every ((3, g 2 ), and 6f> *(/?, 9l ) = b op \P, 9l ) for every ((3, 9l ). 
In order to show policy convergence, it is useful to write 6f lax as: 

l 



0, gt < 



(tf«-0'(&)' 



bf<*(f3 u9t ) = arg b {g = (Ut-iYiPt ~ b)} , w ^ m <g t < w ^ m , (43) 



Pu g t > 



(tf«-0'(o)' 

which is identical to the expression for 6° pt in ([5]) except replacing (J^i)' with (Ut-i)' ■ Since J° pt x and Ut-i 
are convex, (J?^)' and (Ut-i)' are increasing and moreover unbounded (shown in Appendix U (a)(ii)). Since 
5min > and g mSLX < oo where g m i n and (? ma x are the lower and the upper bounds of the support of the PDF / 
(Support(/) = [<7 m i n , <?max])> there exists *Bo such that if [3 > 2$o then 

9 ™'" > max [w^wr mm) (44) 
9 »" < min {wjwy mm) ■ <45) 

Henceforth, we only consider > 5So> an(4 - thus, no truncation occurs in both policy functions, i.e., 6f lax and 6° pt 
are determined by 

— = HP-bfn, (46) 

gt 

— = i>(f3-bT), (47) 

for [3 > <B , where 0(/3) = (U t _ x )'((3) and V(/3) = ( J° pt i)'(/5)- 

Let e > be given. By Lemma Q] (stated later in Appendix HIT), there exists 2$i(> 2$o) such that if £ > 53i, then 

10(0 — ^(01 < e. (48) 

Since /3 — 6° pt and 6° pt are strictly increasing in (3 (when (3 is sufficiently large) by Proposition Q] and (3 — b° pt (= 
, ip~ 1 (e bt / gt)) is unbounded due to the unboundedness and the monotonicity of ip, there exists 2$2 such that (3 > 
implies 

(3 - b opt (P, g t ) > »i, V 5t G [ 5min , 5max ] . (49) 

Therefore, if /? > «8 2 , 

|0(/3 - 6° Pt (/3>^)) " W " &r(A<ft))| < C V 5t G [ 5 min, <7max]. (50) 

If 6° pt < 6f lax , 

009 - &r' ax ) < HP ~ bf) < W ~ bT) + e, (51) 
where the last inequality follows from d50l ). Additionally, we have 

009 " bf m ) = — > — =^(f3- bT)- (52) 

At gt 



16 



Combining ED and (g2]>, we have <f>(fi - bf aK ) - i/>(p - 6° pt ) < e. By the same argument for b° pt > 6f lax , we have 
V>(/3 - 6° pt ) - HP ~ bf™) < e. Thus, we obtain 



\<t>((3 - bf**(/3,g t )) - ^{(5 - bf(/3,g t ))\ < e, Vg t e [ 5min , Smax]- 
By (l46l) . (|47"T ). and the continuity, &f lax (-,<?t) — ► b° pt (-,gt) uniformly on [<7 m in, Smax] is obtained. 



(53) 



Lemma 1: If (-,g t -i) -> b^-^t-i) uniformly on [# min , c/ max ], then 

lim [(C4_i)'(/3) - (J° pt i)'(/3)] = 0. 

p-n>oo 

Proof: From © and ([5]), we write the expected cost-to-go as: 



(54) 



J t-2> 



J* 2 {p)dF{x) + 



o j £ x 

f"P' vim 



e /3 _ 1 



dF(x) + 



TO 



^3 



e 6 * - 1 



+ J°Z t 2 (/3-C t i) 



dF(x), 



(55) 



where 6°^ is a function of /3 (and x). By differentiating J° pt x using integral calculufl the derivative (with respect 
to 0) of J°!_\ is: 



(Sw 



e 6 - - 1 



dF(x) 



(56) 

Since (J^ 2 )' is unbounded increasing and Support(/) = [g min ,5ma X ], (j^* 1 ),^) < Smin and llo f V( ^ > 5max for 
sufficiently large j3, and thus 



lim 

/3^oo Jo 



U~T 2 )'C/3) / jopt y 



OCT '(/3)/(x)dx = 0, 



1 



lim er \ — f(x)dx = 0. 

(•/*,)'(<)) 



(57) 
(58) 



opt 

"t— 1 



Since^ = (J^ 2 )'(/?-6° p _ t 1 )forx€^ 1 



by ©, 



dp 



+ ^ t 2 (/5-C t i) 



e 6 - - 1 



e °i-i 



As a result, the derivative of the expected cost-to-go can be stated simply in the limit of large (3: 

lim (J° P i)'(/3) = lim / ( ^ W dF(x). 



(59) 



(60) 



From (|7]), we have 



lim 

/3->oo 



lim 

/3->oo 



lim 

/3-xx 



_t X 

.0 e/3 



. _________ . ____________ trelax l°P' 



e'- 1 G(ft-i,-- - >^i) 



dF(x) 



(J t T 2 )'</3) 



(61) 



tf'(x) = h(x,<p{xW(x) + /; W §&(-,»)__ for H(x) = /; w 
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Since Support(/) = [g min ,g max ] C ^vy^y (f^yjp) 
converges to b°*\(f3,x) uniformly on x £ [ff m in,5max], 



(for large (3) and the induction hypothesis that bf_l(f3,x) 



lim 



/.relax LOpt 

to) e°*-i - e *- 1 



f{x)dx = lim 

/3— ♦oo 



Lrelax i,°P' 
= °t-l — e *-l 



X 



f(x)dx = 0, 



(62) 



and thus 



lim [(tUn/?) - (J£)'(/3)] = lim 

fl-^oo fi—l-OO 



e*- 1 Gl^t-iv 



(^ p _! 2 \'03) 



; .relax 



-dF(x) 



(63) 



By substituting (TTTT) into fe^f and re-writing G(ft-i, ■ • ■ , ^l) as 

G(ft-i, ■ • • , v\) = (G(^-2, • • • , ^i))^ 
we have lim^ [(U t -i)'(P) ~ (^'i )'(/?)] = 0. 



dF(x), 



(64) 



Appendix III 
Proof of Theorem [2] 

We will prove this by showing that limB-foo [J* Ux (B) - U T {B)] = and lims-nsc [U T (B) - J^ P \B)] = 0. 
(i) First, we show lim [J^ elax (B) - U T {B)] = by induction. Notice that Jf lax (/3,g) > U t (fi,g) for all values 

B — KXJ 

of p and g by the constructions © and <22), and thus, |J t relax (/3, 5) - C/t(As)| = Jf^{l3,g) - U t (p,g) and 
I Jf lax (/3) - U t {(3)[ = Jf lax (/3) - U t (P). 

By (0 and ((22l . J{ elax = J7i. Let e(> 0) be given. As an induction hypothesis, we assume that 



lim [J^\(B)-U T -i(B)] = 0, 

B — >oo 



(65) 



i.e., there exists <B such that if B > *B then J^\(B) - U T -i(B) < e. 

To differentiate 6f lax and the solution to ©, we let &™ truncated be the solution to ©, and thus the relation of 

the two is bf°*((3 t ,g t ) = (b^^i^gt))^ where &— cated (A, g t ) = \(3 t + log (jfe) by ©. 
Notice that 

J? lax (B) - £7 T (B) = E [jT elax (B,<?T) - UT(B,g T )] 

1 



When 



E 



+ E 



+ E 



1 



J^(B,g T )-U T (B,g T ) 
J^{B,g T )-U T {B,g T ) 
J^{B,g T )-U T {B,g T ) 



(Ut-iY(B) 
1 

9t < 
9t > 



< 9t < 



(Ut-i)'(B) 
e B 

(Ut-i)'(O) 



(Uir-i)'(O) 
Pr{ g T < 



Pr 
1 



(Ut-i)'(B) 



< 9t < 



(C/t-i)'(0) 



Pr J g T > 



(Ut-iY(B) 
(Ut-i)'(O) 



(66) 



(Ut-i)'(B) 



< g T < (u T l*Y(py b f ax ( B ^9T) = b^ tmncated (B,g T ) by <@3]>, i.e., no boundary cases occur, 



jf™{B,g T ) - U T (B,g T ) = j$$(B - bf) - U T -i(B - 6£ lax ), 
where bf m = ±£ + 2^1 l og 5T G(^t-i, ■ ■ ■ , v\) by CD for 1 



(U T -i)'(B)> (Ut-i)'(O) 



Thus, 



B - 6? lax = ^£ - log 5t G(^t-i, • • • , ^1), 



(67) 



(68) 
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which is strictly increasing in B and unbounded. Therefore, there exists <Bi such that B > *Bi implies 
B - bf ax > <B uniformly for all g T G [g min , g max \. Thus, Jf™ x {B - 6£ lax ) - U T -i(B - bf M ) < e for 
B > 55 1 uniformly for all £ [dmm, Smax] by (|65l ) and consequently, 



lim E 



J? lax (S )5T )-C/ r (S, 5r ) 



1 



(^T-l)'(S) 



< 9t < 



(^T-l)'(O) 



Pr 



(^T-l)'(B) 



< 9t < 



(^r-i)'(O) 



lim 



lim 
lim 



(£/ T -l)'(0) 



[J^ lax (i?,x)-C/ T (i?,x)] /(s)cfc 



(C/ T -l)'< B ) 

^ [J^ elax 0B,x) - l/r(B,x)] /(x)da: 



[Jt_ x (£ - 6? lax ) - I7 r -i(5 - 6? lax )] /(x)ds = 



For sufficiently large 5, ( ^ t _^, (b) < Smin since (j7 r -i)'(5) = e B / T Q(vt-\, ■■■ and ( ^^ )<(0) 
and thus 



Pr {g T < 



1 



(tf T -i)'(0) 



(Z7 T _!)'(B) 

Consequently, the induction follows, 
(ii) Second we show lim_B_ i . 0C [Ut{B) — J^ pt (-B)] = by induction again. 
At t = 1, all the bits are to be served and thus: 



(69) 
(70) 



3^-1 



ePu 1 -vi = U 1 (0), V/3(>0), 



where z^i is defined in ([8]). As an induction hypothesis, we assume that 

lim [Ut-M-JFM =0. 
From (0]) and ®, we write the expected cost-to-go as: 



o 



(JJ=1, ' W JTMf(x)dx+ 



f(x)dx+ 



where b° pt is a function of /3 (and x). Since (J^i)' is unbounded increasing and Support(/) = [p m m)5n 
r/gpryTfl) < 5min and rpryp > ^max for sufficiently large (3 as did in Lemma Q] and thus 



(71) 
(72) 

dF(x) 
(73) 



lim 

/3^oo Jo 



TJf ^ J° p _\((3)f(x)dx = 



lim 



/(x)dx = 0. 



From Theorem [T] and the induction hypothesis (1721 ). 



lim [&f lax (/?,<?) - 6f (fts)] = uniformly V<? G b min ,S m ax], 

lim [^_! 03 - 6f lax (/3, 5 )) - J° p _\ (/3 - b° pt ((3, <?))] = uniformly V<? G [ 5min , <? n 

/3 — >oo 



and thus, 



lim J° pt (/3) = lim 

/3—>oo /3—*oo 



(J7-1>'(°) 



1 



+ [/ t _ 1 (/3-6f lax ) 



dJf(x) 



(74) 
(75) 

(76) 
(77) 

(78) 
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Therefore, 



lim [U t {p) - Jt P \P)} = lim 



te* G,{v t ,vt-i,-~ , v i) -tv\ 



Lrelax 

e°* — 1 



+?/ t _i( / 9-&r tt ) 



dF(x) 



lim 



te* G(^,f i _i,-- - ,^i) 



opl 



Lrelax 

e * — 1 



+ (t-l)e *-i G(^-i,-- - 



dF(x) 



(79) 



By substituting (fTTT ) into 6£ elax , we have lim / 3^ 00 [C/t(/?) — J° pt (/3)] = as desired. Thus, the induction holds. 
By (i) and (ii), we obtain the result as desired. 



Appendix IV 
Proof of Theorem [3] 

First, we show that the optimal scheduler becomes a threshold policy as (3 — > 0. As in ©, the optimal policy 
is determined differently by the range of g t : g t < 1/{J° P \ )'(/?), l/(J°l\)'((3) < g t < e /3 /( J°!_\)'(0), or g t > 
^/(^l)'(0). Since lim^ ^ = 1 and lim^oC^-i)'^) = (^i)'(O), 



lim 

13^0 



1 



(^)'(o) (JTiVCS) 



o. 



(80) 



which implies that the case of l/(J° pt 1 )'(/?) < gt < ^/(Jt-i)'(^) occurs with vanishing probability as /? — > 0. 
Thus, the optimal policy is a threshold policy , i.e., 



(3, gt > 
0, 9t< 



(j^i)'(O)' 

l 

(jj?i)'(0) 



as /3 — > 0. This implies that 

lim sup{ 5 : b?(J3,g) = 0} = lim inf{ 5 : b° p \(3,g) = 0} 



(^ pt i)'(o)' 



(81) 



(82) 



Second, we show that the thresholds are identical, i.e., lim / g^o(^f° P i) / (/3) = for every t, where ui t is defined 
in (TT6T ). When t = 2, this holds by construction. As an induction hypothesis, we suppose that lim^o^i^)' (/?) = 
(J° pt 2 )'(0) = ujt-i. By (T56]) and ([59]), 



lim(J, opt 1 )'(/3) = lim 



(^ pt 2 )'(o) 



' (^? 2 )'G9)dF(x) + 
i 

fOO J 



topt 



—dF(x) + / dF(x) 

e£ X ' / 1 X 



(J t T 2 >'(« 



dF(x) + 



i x 



cLF(x) 



(83) 



(J° pt 2 )'(0)Pr \g t < 



1 



,"pl 



+ E 



9t > 



1 



(am 



(^ pt 2 )'(o) 



where the last equality follows from (fT6l ) by substituting cj t _i into (J° p 2 )'(^) fr° m tne induction hypothesis. Thus, 
the induction holds. 
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Appendix V 
Proof of Theorem @] 



Since Jf ne (B) = (e B - 1)E | , (Ji ne )'(°) = E 

(Jr e 2 )'(°) = then from CI 



(jri)'(B) 



Thus, 



J- e 2 (S)dF(x) + 

(jr 2 )'(i?)^(x) + 
i 



cj 2 by (fl6ll . If we suppose that limB_>o(«7°.!!|)'(- B ) 
dF[x) 



00 e B -1 



1 x 
t-i 



00 e s 



t-1 



dF(x). 



(84) 
(85) 



lim (jn)'(S) = (Jr 2 )'(0) / " iF(x) + / -dF(x) 



1_ X 

t-1 







1 


1 


Pr < 






► +E 




- > w t _i 








5 


5 







(86) 



By induction, unie^o^™ 6 )'^) = wt+i- hi the proof of Theorem [3l we have shown that liniB^o(<^r Pt ) / (^) 
ujt+i also. Since lim^o J™ e {B) = limg^o J^\B) = 0, by L'Hopital's rule, we have 



By definition, 



B-o j° pt (£) b^o (j^ pt y(B) 

Appendix VI 
Proof of Theorem [5] 

E**(b) < ^JfibT) < Ij™ nstrained - er g(6T). 



(87) 



Let e > be given. Since E txg {b) is an increasing continuous function of b, there exists 5 > such that 

&**{b) + e = E erg (b + 5). 
We use this 5 for 6^- delta . Then, by and gg), 



(88) 



(89) 



constrained-erg 



1 



(bT) < — E 
T-1 



E + 



t=2 



51 



r 



■ E 



T 



1 e 6f 8 (6+«5,g,) _ ^ 



i=2 



<7f 



1 

T 





[e^ 1 -r 




Eft 






. 51 _ 





(90) 



Notice that { 6 - ( + ' at> — are i.i.d. and thus, 



E 



t=2 



T h <* 



9t 



E 



e bf g (b+S,g t ) _ 1 

9t 



E ei z(b + 5). 



(91) 



Since {b e t rg }J =2 are i.i.d., ^-j- ^^=2 ^f S ~^ ^[bf g ] = b + 5 almost surely (a.s.) as T — > 00 by the law of large 
number, and thus, the remaining bits at the final slot is given by 



&T-5>-" = (T-l) 



t=2 



-b-—Yb? g ) 

1 r- ^ ' 

t=2 / 



<6 + 5 a.s. 



(92) 
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That is, e 131 < e b+s a.s. and therefore E [e A ] < e b+s . 



E 5l 
Thus, 



IE/3, 



91 



e b+s _ 1 



9i 



mm Eo, 
&=f>+<5 



e" - 1 



< min E 



91 \ E[b]=b+8 



e b - 1 
ffi J 



E eIg (b + S) (93) 



lim —J,' 

r^oo 



constrained-erg 



(bT) < lim 

T — >oo 



?Li^(5 + 5) + 1^(6 + 5) 



£ erg (6 + 5). 



Therefore, 



E^ib) < lim -J" 



constrained-erg 



(bT) < E"S(b)+e. 



Since e is arbitrary, we have the result. 



(94) 
(95) 



Appendix VII 
Proof of Theorem [6] 



First, we show the monotonicity of A^\B). Since 

—A opt (B) = ( J t)'(B)JT(B) ~ J C T \B)(J^)'(B) 
dB T[ ' (JT(B)) 2 

we will investigate the quantity (J^)'(B)J^\B) - J^(B)(J° pt )' (B). 
From ©, ©, and (|3TT >. we have 



(96) 



J°/(B) 



J opt _ l (B)dF(x) + 
dF(x) 



(4 P L !>'(") 



e br P ' _ 1 



+ J°^_ l {B-bf) 



dF(x) 



e B -l 



(97) 



(J° pt )'(i?) 



dF(x) 



(4 p L!)'(o) _d_ 



dF(x) 



..pi 



J^(B) = T(eT - 



(JJP)'(B) 
When T = 2, 



(98) 

(99) 
(100) 



(J^(B)jr(B)-r 2 \B)(jrY(B)= / " e f^ 2 (-l)(^-l) 2 /(x)^ 

JO 



+ / „ e-^i(-l) 



1\» f 
x) \vi 



f(x)dx 



(101) 



e 2 



.r 



(-l)(eT - l) 2 f(x)dx 



< 0. 



That is, — jg— 1 - < 0. 

We now suppose that "" T rf "^^ ; < and examine ""J^"" -1 . That is, we assume that 

(j^y (£)4 p -i(^) - j^xj^)'^) < o, 



(102) 
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where the left hand side is the numerator of — ^ — from d%l ). The numerator of — ' is given by 

(J^)'{B)J° T pt (B) - 4 q (B)(J° p ')'(B) 

= y <J ^-i ),(B) i/j [e# j^B) - T(e* - 1)(Jt-i)'(^)] f(x)dx 



+ 



+ 



e#^_±-T(e*-l)^- 



f(x)dx 



e 6 ?' - 1 



+ j^fl - ^slj - r( e T - 1 



d [ e b T - 1 



dB \ x 



f(x)dx 
(103) 



From the integrand of the first integral in ( 11031 ), 

e*J?_ x {B) - T(e# - l)^)'^) = e"^> [ e 3^ j^B) - (T - l)(e^ _ 1)(J^ 1 )'(B) 



+ 



T-e~ -{T-l)e~^^ (^^'(B) (104) 



From (j99l), (1 1 00b . and the hypothesis (11021 ). we have 

e£*J?_ x tB) - (T - l)(e^r _ 1)(J°>* < 0. 

We define a concave function <fi such that 

cj){z) = T - z T - 1 -(T-l)z- 1 , z>0. 



(105) 



(106) 



Since the concavity and 4>'{z) = yield that <ft attains its maximum at z = 1 and (f)(1) = 0, <p(z) < for all z > 0. 
Since ^(e T < T - 1 ) ) = T - e~ - (T - l)e T < T - 1 ' , 



T — e~ — (T — l)e n^-i) < o. 
From (11051 ) and (11071 ) along with the fact that Jj?_ x is convex, (11041 ) becomes 

ef Jg_ x {B) - T(e# - 1)( ^/(B) < 0. 
Likewise, from the integrand of the second integral in (11031 ). we want to show that 



eT( e B 



1) -T(e~ - l)e B < 



(107) 



(108) 



(109) 



which is equivalent to show e B — 1 — T(l — e r )e B < 0. If we define a concave function tf>(z) = — (T — l)z T + 
Tz T ~ l — 1 for z > 0, ^(e?) = e B — 1 — T(l — e~~)e B . As we did before, we can show that ip(z) < 0, and thus 
(fT09l holds. 

From the integrand of the third integral in (1103b . we want to show that 



e^ ( e b ^ - 1 ) - T ( e^ - 1 ) < 0, 



ef 4^(B - bf_ x ) - T (e# - ij ^4 P 'i(^ " &£i) < 0. 



(110) 
(111) 



To prove (111 0b . we can write 



e t 



( b° pl 

( e° T 



J ) - :/ (ef -lje 6 * 

B-b 0pl 



e r 



6° pt 



e—^Te bT I 1 - e" 



Notice that d 109b holds for every B > and thus 



opt 



e — ( e "T _ i) _ T( e — - l)e 6r < 0. 



(112) 



(113) 
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Therefore, dl 10b holds. By the similar argument, (111 II ) holds, too. Thus, the third integral in d 1031 ) is no greater 
than 0. Consequently, we obtain 

(J^)'(B)J° pt (B) - J^(B)(J^)'(B) < 0. (114) 

This shows that the monotonicity of Ay (B) inductively. 

Second, the limits are calculated with J^ lax (P) and J™ e (£>) since Jj? l (B) converges to the former for large B 
and to the latter for small B by Theorem [2] and Theorem |4] respectively. 



Appendix VIII 

Derivation of High SNR Affine Approximation Parameters 
From d3l"l ), the spectral efficiency of the equal-bit scheduler can be found by 



Thus, at high SNR 



P = { e R T - l) Vl or R^{P) = log M + V 



R^{P) = logP-logi/ 1 +o(l). 



Similarly, from (|7]) and Theorem the spectral efficiency of the optimal scheduler at high SNR is given by 

R°v\P)= log P-\ogG(v T ,--- ,vi) + o{l). 

At high SNR, the ergodic capacity can be approximately given by the uniform power control: 

R^(P)^E[log(l+gP)] 
~logP + E[log<?] 



log P — log e 

lOg P - log U n 



where the last equality follows from 

e E N(i)] = 



lim e 



lim E 

m— >oo \ 



(115) 
(116) 

1 

(117) 



(118) 



(119) 



Appendix IX 

Derivation of Low SNR Affine Approximation Parameters 



From (f!T5l 



By [4], 



Then second order analysis is given by 



Therefore, by [4], 



P eq (P) 



i + 



log 2 
P e i(0) 



(log 2)i/i. 



(im; 



^0 



P ec i(0) 



2. 



i? e i(0) 

By Theorem |U the average total energy cost of the optimal scheduler at low SNR is given by 



3f = (e B -l) E 



1 

mm | — , E 

9T 



mm 



9T-1 



-,---E 



mm | — , E 

92 



1 

91 



(120) 



(121) 



(122) 



(123) 



(124) 
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With the per slot basis notations, 



TP = {e TR *-l)K 



mm 



9T 



-,E 



mm 



9T-1 



and thus, 



#*(P) = -log| I.+ 



•E 



TP 



mm 



92 



1 

91 



E 



mm 



9i 


) 


) 


) 



(125) 



(126) 



Therefore, we have 



i? opt (P) 



R opt (P) 



1 + 



TP 



E [ mi n(^,E[min(^,...E[min(i,E[^j)j)j)j 

t; , 

(4 mm (i4 mm (^--4 min (i4d)])])]) ; 



(127) 



(128) 



Thus, 



Eb 
N 



log 2 
i?(0) 



cone 
^0 



-2 



l 1 + 

(log 2) E 

2 



TP 



E [min(^,E[min(^,..E[min(i,E[^j)j)j)j 



. / 1 










mm — ,E 


min 




••E 


min 


\9T 




\9t-i' 







92 



-,E 



1 

.91 



i2 one (0) 



2 



i? one (0) 

See [4] for (jjr) and <S>o of the ergodic capacity. 
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